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ABSTRACT 

The chemical abundance patterns of the oldest stars in the Galaxy are expected to contain residual 
signatures of the first stars in the early universe. Numerous studies attempt to explain the intrinsic 
abundance scatter observed in some metal-poor populations in terms of chemical inhomogeneities 
dispersed throughout the early Galactic medium due to discrete enrichment events. Just how the 
complex data and models are to be interpreted with respect to "progenitor yields" remains an open 
question. Here we show that stochastic chemical evolution models to date have overlooked a crucial 
fact. Essentially all stars today are born in highly homogeneous star clusters and it is likely that this 
was also true at early times. When this ingredient is included, the overall scatter in the abundance 
plane [Fe/H] vs. [X/Fe] (C— space), where X is a nucleosynthetic element, can be much less than 
derived from earlier models. Moreover, for moderately flat cluster mass functions (7 < 2), and/or for 
mass functions with a high mass cut-off (Mmax ^ 10^ M©), stars exhibit a high degree of clumping in 
C— space that can be identified even in relatively small data samples. Since stellar abundances can be 
modified by mass transfer in close binaries, clustered signatures are essential for deriving the yields 
of the first supernovae. We present a statistical test to determine whether a given set of observations 
exhibit such behaviour. Our initial work focusses on two dimensions in C— space, but we show that the 
clustering signal can be greatly enhanced by additional abundance axes. The proposed experiment will 
be challenging on existing 8-lOm telescopes, but relatively straightforward for a multi-object echelle 
spectrograph mounted on a 25-40m telescope. 
Subject headings: Galaxy - dwarf galaxies - stellar populations - star clusters - elemental abundances 



1. INTRODUCTION 

The Galactic stellar halo is a vast ancient repository 
that takes us back to a time when dark matter collapsed 
into bound structures and the Galaxy was seeded for the 
first time. Helmi (2008) and Tolstoy, HiU & Tosi (2009) 
provide excellent overviews of the many stellar systems 
and fragments that inhabit the halo. These include field 
halo stars (Christlieb et al 2002; Cayrel et al 2004; Frebel 
et al 2005; Cohen et al 2007), globular clusters (Graf- 
ton, Sneden & Carretta 2004), dwarf spheroidals (Mateo 
1999; Venn et al 2004), ultra-faint dwarf galaxies (Simon 
& Geha 2007; Kirby et al 2008), stellar streams (Ibata 
et al 1995; Chou et al 2010), stellar associations (Walsh 
et al 2007) and satellites to dwarf galaxies (Coleman et 
al 2004; Belokurov et al 2009). Already the chemical in- 
formation arising from the most metal-poor stars is very 
difficult to unravel (Nomoto et al 2005; Kirby et al 2008). 
We have barely begun to understand what these systems 
are telling us about the sequence of events that led to 
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the Galaxy (McWilliam, Simon & Frebel 2009; Freeman 
& Bland-Hawthorn 2002). 

The first stars were unique to their time. The first 
stellar generations changed the universe in many ways, 
in particular, the chemical properties and the equation of 
state of the intergalactic medium. But at present there 
are many unknowns. Did the first stars form in isolation 
or in groups? Were relatively few massive stars responsi- 
ble for reionization or was it triggered by the collective ef- 
fect of massive star clusters? Just what are the processes 
that govern star formation at extremely low metallicity? 
Is this exclusively the domain of the most massive stars, 
or can substantial intermediate and low mass stars form 
(Tsuribe & Omukai 2006, 2008; Clark, Glover & Klessen 
2008)? In other words, did stellar populations observable 
today exist before reionization (Tumlinson 2010; Okro- 
chov & Tumlinson 2010)? The first star clusters are of 
great interest in that they shed light on star formation 
processes in the early universe (Bromm, Coppi & Lar- 
son 2002; Abel et al 2002). There are few if any rehable 
constraints at the present time. 

One of the most interesting developments in recent 
years is the simultaneous measurement of many elemen- 
tal abundances for individual metal-poor halo stars or 
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groups of stars (e.g. Beers & Christlieb 2005). Some of 
these elements, but not all, exhibit intrinsic scatter in 
the abundance plane [Fe/H] vs. [X/Fe] that exceeds the 
measurement errors. While the scatter is particularly ap- 
parent in halo stars (e.g. Roederer et al 2009), evidence 
is now emerging that star-to-star abundance variations 
exist in dwarf galaxies as well (Fulbright, Rich & Castro 
2004; Koch et al 2008; Feltzing et al 2009). The ob- 
served scatter is likely to increase now that metal poor 
stars are now detected below [Fe/H] = -3 (Norris et al 
2009; Starkenburg et al 2010; Simon et al 2010; Frebel, 
Kirby & Simon 2010). 

This has led numerous researchers to argue that the 
scatter in [X/Fe] is a tracer of an ancient inhomogeneous 
medium (e.g. Audouze & Silk 1995; Ryan, Norris & 
Beers 1996; McWiUiam 1997; Ishimaru & Wanajo 1999). 
If this interpretation is correct, we would expect that a 
subset of these stars is telling us something fundamen- 
tal about the first stars and their yields. But not all of 
the stars are providing us with an unambiguous "read- 
ing" of the early enrichment of the primordial interstel- 
lar gas. For example, a significant fraction of extremely 
metal poor stars appear to have undergone mass transfer 
with a companion (Ryan et al 2005; Lucatello et al 2005) 
which undermines any attempt at inferring the progeni- 
tor yields for elements such as CNO and a elements. Bi- 
narity may partly explain why the elemental abundances 
of the most metal poor stars defy a clear explanation at 
the present time (Joggerst et al 2010; McWiUiam et al 
2009). 

We now incorporate a missing ingredient into existing 
models of stochastic chemical evolution. In the present- 
day universe, most stars are born in a single burst within 
compact clusters and stellar fragments, rather than in 
isolation. This fact is well established in the local uni- 
verse (Lada & Lada 2003) and is likely to be true at 
the time of the first stars (Clark et al 2008). Cluster 
formation has an important consequence for the distri- 
bution of stars in the abundance plane. De Silva ct al 
(2006, 2007a,b) have shown that both old (- 10 Gyr) 
and intermediate-age {'^ 1 Gyr) open clusters are chem- 
ically homogeneous to a high degree (A[Fc/H] < 0.03 
dex). The open cluster Tombaugh 2 was thought to be a 
rare example of a chemically inhomogeneous open cluster 
(Frinchaboy et al 2008), but this is contradicted by a new 
study that finds the stellar population to be highly ho- 
mogeneous (A[Fc/H] < 0.02 dex; Villanova et al 2010). 
Apart from a few light elements, the same holds true for 
globular clusters (Gratton et al 2004), although some sys- 
tems (e.g. wCen) show evidence for more than one burst 
of star formation. Interestingly, even moving groups can 
show the same signature of chemical homogeneity (De 
Silva et al 2007a; Chou et al 2010; Bubar & King 2010). 

Bland-Hawthorn, Krumholz & Freeman (2010) pro- 
vide a condition that ensures chemical homogeneity in 
a young star cluster based on the surface density of the 
progenitor gas. They show that chemical homogeneity is 
expected in open clusters, globular clusters and plausibly 
clusters more massive than IO^Mq. This process has not 
been incorporated into stochastic chemical models to date. 
Once this is done, we arrive at an important insight that 
will enhance the interpretation of metal abundance dis- 
tributions in individual stellar populations. The effects 
that we highlight can be searched for in existing and in 



future surveys, as we show. 

A key assumption in our present work is that present- 
day dwarf galaxies are important sites for establishing 
the yields of the first stars and star clusters. This needs 
some clarification. While it is likely to be true that the 
most efficient way to identify metal-poor stars is to target 
dwarf galaxies, a low value of [Fe/H] is no guarantee that 
a star is ancient since it may simply reflect environmental 
conditions (e.g. low star formation efficiency, weak grav- 
ity field) . Published numerical simulations are unclear on 
whether the most ancient stars are solely the preserve of 
the inner bulge (White & Springel 2000; Bland-Hawthorn 
& Peebles 2006) or spread over the entire Galaxy (Scan- 
napieco et al 2006; Brook et al 2007). But it is now well 
established that the bulge, the halo and all dwarf galax- 
ies comprise stellar populations that are 10 Gyr or older, 
equivalent to a formation redshift of z > 2 (e.g. Tolstoy 
et al 2009) . The relative fractions of dwarf populations 
that formed before, during or after the reionization epoch 
is an open question. We are therefore at liberty to ex- 
plore new observational constraints on the nature of the 
first stars and star clusters. 

In §2, we present evidence for homogeneous star clus- 
ters in the local universe and argue that the same should 
be true for their high-redshift counterparts. In §3, we 
briefly outline the inhomogeneous chemical evolution 
models that have been developed to date. In §4, we in- 
troduce a revised stochastic model of star formation in 
dwarf galaxies that incorporates the 'homogeneity' condi- 
tion described by Bland-Hawthorn et al (2010), and show 
the predictions of the revised model in view of the earlier 
work. In §5 and in the Appendix, we introduce cluster 
finding algorithms for both large and small data sets, 
and demonstrate how the effects of clustering may al- 
ready be visible in existing observations. We also explore 
the longer term prospect offered by a multi-object echelle 
spectrograph on an extremely large telescope (ELT). The 
conclusions are presented in §6. 

2. HOMOGENEOUS STAR CLUSTERS 

Before revisiting stochastic chemical evolution mod- 
els, we review the key arguments presented in Bland- 
Hawthorn et al (2010) on the conditions under which star 
clusters are expected to be highly homogeneous in most 
elements, as observed for local star clusters (§1). We then 
extend these arguments to clusters at low metallicity. 

As a star-forming cloud assembles, turbulent di ffusion 
within it will ho mogenize its chemical composition ( Mur- 
ray fc Lin|1990 ) . For clouds whose turbulent motions are 
primarily on l arge scales (as is t he case for all local molec- 
ular c l ouds -|Heyer k Brunt I ( |2004) ), Bland-Hawthorn 
et al. (2010) show that the time required tor this pro 



cess to smooth out a composition gradient on the size 
scale of the cloud is roughly t^ = L/a, where icr is the 
cloud crossing time, L is its size, and cr is its velocity dis- 
persion. Smaller-scale gradients are erased more quickly, 
with the diffusion time varying as the square of the (nor- 
malized) characteristic size scale. Since tcr is comparable 
to or smaller than the timescale over which star forma- 
tion takes place, clouds will homogenize as they assem- 
ble, and will be pushed away from homogeneity only if 
supernovae occur during the star formation process, be- 
fore the cloud disperses. The time required for a very 
massive star to evolve from formation to explosion de- 
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fines the supernova time scale, tsN ~ 3 Myr, and we 
only expect star clusters to be homogeneous if they are 
assembled on time scales shorter than isN- 

To determine under what conditions this requirement 
is satisfied, we must compare isN to the cluster forma- 
tion timescale tform- There is considerable debate over 
this timescale, so for our purposed we will adopt the 
longest, most conservative proposed timescale of 4 icr 
(Tan, Krumholz & McKee 2006). Since 



tc 



0.95 



V^virG V^^/ 



M\ 



1/4 



(1) 



for a cloud of mass AI, column densit y S, and virial ratio 



Qyii-; observed clouds have avh- « 1.5 (McKee & Ostriker 



2007| . For convenience we write the final stellar mass 
of a cluster as M* = eM, where e is the star formation 
efficiency. Both observational and theoretical arguments 
suggest e « 0.2 independent of M (Lada & Lada 2003; 
Fall, Krumholz & Matzner 2010). Thus the condition 
that tform ~ 4icr < 2tsN (whcrc the factor of 2 arises 
because the typical star forms halfway through the for- 
mation process) is satisfied only if 



Mli'j:-'/' < 2.8, 



(2) 



where M*,5 = M^/{1Q^ AIq) and Eq = S/(l g cm^^)^ 
and we have adopted fiducial values of e = 0.2 and 
avir = 1.5. Star- forming regions within the galaxy have 
So ~ 1 g cm^^ independent of mass (Fall et al. 2010). 
Globular clusters today have somewhat higher values of 
E, although it is unclear if this reflects the conditions 
under which they formed, or is the result of dynamical 
evolution since their formation. Regardless, this analy- 
sis suggests that clusters with masses up to ^ 10^ Mq 
should be chemically uniform and maybe much higher 
if systems form with the nuclear densities observed in 
today's globular clusters (Bland-Hawthorn et al 2010). 

There are few observational constraints on the exis- 
tence of star clusters in extremely metal poor gas. In 
the nearby universe, clear evidence for star clusters at 
[Fc/H] « -1.7 is observed in the most metal-poor, blue 
compact dwarf galaxy I Zw 18 (Izotov & Thuan 2004). 
Globular clusters are known to exist down to [Fe/H] « - 
2.4 (Gratton et al 2004). We now argue that the above 
estimate for the uniform mass limit is likely to apply at 
metallicities as low as [Fc/H] ^ -5. 

Low metallicity can change the star formation process 
in two ways that are relevant to the chemical signatures 
of the resulting stars. First, a change in metallicity can 
affect the way star- forming clouds fragment; if there is no 
fragmentation down to sub-solar masses below a certain 
metallicity, then no stars below that metallicity will sur- 
vive to the present day. Fragmentation of low metallic- 
ity gas has received extensive attention in the literature, 
which we will only summarize here. The main point rel- 
evant for our purposes is that, when dust cooling is con- 
sidered, gas is able to fragment to sub-solar masses even 
at metallicities at low as ~ 10~^ of the solar abundance 
(Clark et al. 2008; Schneider & Omukai 2010). Once 
fragmentation is possible, turbulence naturally generates 
a mass spectrum of fragments with a slope comparable to 
the Salpeter slope dn/dm ex m~'^-^^ (Padoan & Nordlund 
2002; Hennebelle & Chabrier 2008), and the properties 
of the turbulence do not depend on the redshift or the 



metallicity. Alternately, Clark et al. (2009) have pro- 
posed that competitive accretion processes would gen- 
erate a universal mass spectrum, although doubts have 
been raised about whether this process in fact operates 
on both theoretical (Krumholz, McKee & Klein 2005) 
and observational (Andre et al. 2007) grounds. In either 
case, however, we expect there to be some sub-solar mass 
stars formed that can survive to the present day. 

Given that small stars form at low metallicity, we can 
then ask whether our conclusions about cluster chemical 
homogeneity will continue to apply in this regime. A fail- 
ure of homogeneity could occur either if clouds did not 
homogenize during star cluster formation, or if the cloud 
properties changed such that the cluster formation time 
became longer than the supernova timescale. The former 
is unlikely because the homogenization time is compara- 
ble to the crossing time, and it is implausible that any 
star formation process could take place in less time. The 
latter would require that protocluster gas clouds all have 
surface densities significantly below ^ 0.1 g cm~^ which 
is highly unlikely to be true. However this would imply 
that the star-forming clouds had surface densities at or 
below the mean surface density of observed high redshift 
galaxies (e.g. Genzel et al. 2006), which is implausible. 
We therefore conclude that homogeneity should continue 
to apply as well.'^ 

3. EXISTING INHOMOGENEOUS CHEMICAL 
EVOLUTION MODELS 

3.1. A statistical treatment 

Many authors have attempted to explain the declining 
scatter in elemental abundances with increasing [Fc/H] 
in terms of star formation within an ISM that is enriched 
by a succession of SN events (Ishimaru & Wanajo 1999; 
Shigeyama & Tsujimoto 1998; Raiteri et al 1999; Argast 
et al 2000, 2004; Wasserburg & Qian 2000; Karlsson & 
Gustafsson 2001, 2005; Qian 2000, 2001; Travagho, Galh 
& Burkert 2001; Fields, Truran & Cowan 2002). In the 
[Fe/H] vs. [X/Fe] plane,"* the scatter in [X/Fe] is due to 
two or more classes of SN that produce distinct yields of 
[X/Fe]. In these stochastic models, the scatter converges 
roughly quadratically to a mean value of [X/Fe] given by 
the supernova yields weighted by the initial mass func- 
tion (IMF). In its simplest form, this behaviour is driven 
by the number of SN events (usn), such that [Fe/H] = 
[Fe/H]niin + logngN, where [Fe/H]i„in is the minimum al- 
lowed metallicity. In other words, if [Fe/H],„i,i — —5 
consistent with the most metal-poor stars to date (e.g. 
Frebel et al 2010), it takes lO'^ supernovae to enrich a gas 
parcel to solar abundance. An example of this behaviour 
is presented in Bland-Hawthorn & Freeman (2004; Fig. 
3) where the number of SNe are indicated. 

•^ Homogeneity could also fail in very massive clusters, those 
larger than ~ 10'' Mq (Maraston et al 2004; Larsen 2009). How- 
ever in this case even if the whole clusters were not chemically ho- 
mogeneous, smaller mass portions within them would be, and there 
is little distinction from the standpoint of the model we present 
below. The effect would simply be to break up very large clusters 
into a number of chemically homogeneous small clusters. Given 
that large clusters are rare, this would not affect our signal signif- 
icantly. 

* In the standard notation, for a given star with a measured 
abundance ratio (X/Fe)», it is convenient to write [X/Fe] = 
log^Q(X/Fe)« — logj^Q(X/Fe)Q which is a logarithmic scale normal- 
ized to solar abundances. 
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If the interpretation of declining inhomogeneity is 
broadly correct, we can hope to learn about the yields 
of the first SNe responsible for the enrichment (Fields, 
Truran & Cowan 2002). In other words, if the scatter 
in [X/Fc] is due to a high-yield source (class A) and a 
low-yield source (class B), an individual star with known 
[Fe/H] has a fraction of Ja = Jt-a/^^sn progenitors and 
/b = J^s/^SN progenitors of a total progenitor popula- 
tion of risN = "■A + ns supernovae. We can simulate 
this with a random variable j = j a drawn from the beta 
distribution function. 



e(/;«,^) 



r(Q + /?) 
r(a)r(/3) 



/""'(I-/) 



/3-1 



(3) 



defined for / e [0, 1] such that /„ ^df — 1 and where F is 
the gamma function. The quantities a and /? describe the 
yields of the two populations, such that a = /a('T-sn — 1) 
and /3 = (1 — /a)('^sn ^1)- The mean, variance and 
higher moments of the normalised distribution ^(/) de- 
pend only on a and /?. 

Equation [3] has several remarkable properties that are 
useful for describing the declining influence of chemical 
inhomogeneities in the early ISM. The_mean is given 
by /U = a{a + l3)^^ = Ja such that Ja is the mean 
of the distribution ^{/a)- The variance is given by 
(j^ = a(3{a + (3)^^{a + (3 + 1)^^ which leads to a scatter 
that declines as a ex rig^'^ as expected^. Even initially 
skewed distributions in ^ converge to the normal distri- 
bution (/i = /a) in the limit of high nsN, as expected 
from the central limit theorem. 

Four realizations of the beta distribution (i.e. no clus- 
tering) are presented in Fig. [T| a total of n* = 3000 
points was used for each simulation distributed in [Fe/H] 
according to Fig. [2] (see §5). The log^ distribution can 
be renormalized trivially to match a given set of [Fe/H] 
vs. [X/Fe] observations (e.g. Fields et al 2002). The 
beta distribution ^ is everywhere continuous in /^, and 
therefore /b, except in the limit of small ha or ng. This 
recognizes the fact that, as stated by Fields et al (2002), 
"a given parcel of ISM gas and dust can be enriched to 
different degrees by the ancestors it had." But in the limit 
of small ua or ub, discreteness effects may exist. Stars 
with very few prior enrichments, if they can be identi- 
fied, provide crucial information on the progenitor yields 
of class A or B sources (e.g. Shigeyama & Tsujimoto 
1998; Karlsson & Gustafsson 2001; Ballero et al 2006), 
but within the domain of the statistical model, these are 
rare events and may be difficult to find unambiguously. 
This is an issue we return to in §5. At high [Fe/H], when 
new sources of metals become dominant (e.g. type la 
SNe, asymptotic giant branch stars [AGB]), the simple 
description in equation [3] breaks down. 

To recap, the model described by equation |3] assumes 
that a star with a given [Fe/H] has been enriched to dif- 
ferent degrees by its type A and type B ancestors which 
is considered to be a continuous rather than a discrete 
process. Equation |3] is used to generate a distribution 
of possible values of Ja at a constant value of [Fe/H] or 
equivalently nsN- More complex models show that there 
is generally no simple relation between Ja and [Fe/H] 
(e.g. Qian 2001; Bland-Hawthorn & Freeman 2004). 

5 Note that in the abundance plane, (T([Fe/H]) oc 10~''-5P<=/H1. 



We note already from Fig. [T] that spurious groupings 
(false positives) are inevitable, particularly when the 
data points are convolved with typical errors of 0.1 dex 
psf for differential abundance analysis. The importance 
of the beta distribution in equation [3] is that it enables us 
to efficiently generate large numbers of unbiassed realiza- 
tions (> 10'^) of the theoretical abundance plane, some- 
thing that is infeasible with full-blown chemical mod- 
elling. These are required to calibrate the statistical 
properties of our group- finding algorithm in §5. This 
step is necessary if we are to identify significant groups 
against a rapidly changing background, as observed in 
Fig. [1] Clusters at low [Fe/H] will have a higher level of 
signincance than abundance groupings at higher [Fc/H] 
(cf. Karlsson et al 2008) and we need to be able to iden- 
tify groups with smaller number statistics. 

But first we must consider more complex treatments of 
stochastic chemical evolution that will allow us to include 
the effects of clustering during star formation. 

3.2. Inhomogeneous stochastic model 

The chemical evolution model that most resembles our 
new work is Argast et al (2000, 2004). Their model was 
computed over a volume (2.5 kpc)'^ for a uniform gas 
mass of IO^Mq at a resolution of (50 pc)^, such that 
they were sensitive to chemical inhomogeneities on mass 
scales as low as lO'^M©. A distinct feature of the Ar- 
gast model is that star formation mostly occurs in the 
expanding shell of material swept up by the supernova 
shock front. They make the simplifying assumption that 
[X/Fe] is determined by the SN yield such that [Fe/H] 
is a combination of the SN yield and the swept-up ISM, 
and that the shell is everywhere fully mixed. 

There are several problems with this assumption. 
First, as far as we are aware, there is no compelling evi- 
dence for star formation occurring in supernova remnant 
shells. The closest one gets are some studies of the Large 
Magellanic Cloud and the Galaxy (Yamaguchi et al 1999, 
2001) which conclude that maybe ~10% of star forming 
regions today appear to be triggered by local supernova 
events. There is a clear distinction that must be made. If 
the star formation occurred in collapsing massive clouds 
triggered by the passing shock wave, rather than in the 
swept up shell, we would still expect these star clusters 
to be highly homogeneous. 

Secondly, core-collapse supernovae are inherently 
asymmetric due to the nature of the explosion mechanism 
due to stellar rotation, magnetic fields, and so on (Wang 
& Wheeler 2008). Asymmetric ejections from these su- 
pernovae are well supported by observations (q.v. Maund 
et al 2009). Therefore, the remnant shells are highly un- 
likely to be chemically homogeneous, again supported by 
x-ray observations of nearby supernova remnants (Wang 
& Wheeler 2008). Asymmetric enrichment would have 
led to even larger spread in the elemental abundances, 
thereby invalidating their model comparisons. 

Thirdly, our new homogeneity condition, supported by 
observation, is essentially independent of the original gas 
distribution. This constitutes a smoothing scale in mass 
below which metallicity variations cannot occur, thereby 
suppressing the amount of scatter observed in the abun- 
dance plane (cf. Argast et al 2004). 

We now introduce our revised chemical evolution 
model which incorporates the onset of chemically homo- 
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geneous star clusters. 

4. REVISED CHEMICAL EVOLUTION MODEL 

4.1. Initial cluster mass function 

In order to derive the impact of star clusters on the 
abundance plane, we must consider the progenitor mass 
distribution of star clusters. It is now well established 
that star clusters have a range of masses that extend from 
a maximum mass (Mmax) to a minimum mass (Mmin). 
The form of the birth distribution is known as the initial 
cluster mass function (ICMF) and is assumed to have the 
form 

dN/dM, = xiM*) = X0M7''. (4) 

The observations may support a universal slope of 7 « 2 
in all environments, i.e. equal mass per logarithmic bin 
(Fall, Chandra & Whitmore 2005, 2009; Lada & Lada 
2003; Elmegreen 2010). In this picture, the only pa- 
rameter that does appear to vary at all between galaxies 
today is Mmax, which can be represented schematically 
as a Schechter function-like cutoff in the ICMF, although 
there is no good reason to prefer this functional form over 
a simple truncation. 

A power law is logarithmically divergent at both low 
and high masses, so it must be truncated somewhere. At 
low masses, the truncation is due to the discreteness of 
stellar masses - the smallest cluster is simply one star. 
At large masses, there must also be a truncation. For 
this reason, and because the molecular cloud mass func- 
tion (as opposed to the cluster mass function) is observed 
to have a non-trivial truncation, there is likely to be a 
maximum cluster mass, and that it varies depending on 
galactic environment. In disk galaxies, a possible expla- 
nation is that the truncation mass is of order the Toomre 
mass in the galactic disk (Toomre 1964; Escala & Larson 
2008), which is ~ IO^Mq for the Milky Way, but is signif- 
icantly larger for present-day starburst/merger galaxies 
(Larsen 2009) and their high-redshift counterparts (Gen- 
zel et al 2006; Forster-Schreiber et al 2009). The situ- 
ation in spheroidal galaxies is much less clear but the 
common occurrence of globular clusters indicates that 
massive cluster formation must take place (Elmegreen 
2010). 

4.2. Stochastic chemical evolution 

We take as our starting point the stochastic chemical 
enrichment model presented in Karlsson (2005, 2006) and 
Karlsson et al. (2008). This model is now updated to 
include the homogenizing processes that must occur dur- 
ing the formation of star clusters (§2). But in order to do 
this, two additional mixing processes must be accounted 
for. First, since stars within clusters show no evidence 
of scatter in chemical abundance ratios, the gas involved 
in the formation of a cluster must be homogenized prior 
to, and stay homogenized during, the formation of the 
individual stars. We include a mixing process to ensure 
that this is always true during the cloud collapse and 
cluster formation phase. Second, since massive stars are 
short lived (r^ < 20 Myr) they will explode as super- 
novae (SNe) before the cluster has dispersed. Assuming 
that the cluster is unbound and that the intrinsic veloc- 
ity dispersion of the stars in the cluster is ~ 1 km s~^, 
massive stars will, on average, be ~ 20 pc apart when 
they go off as SNe. This is less than the typical size 



(^ 100 pc) of a SN remnant (SNR) as it merges with the 
ambient medium, (e.g. Ryan et al. 1996). Therefore, 
we assume that the ejecta of all SNe formed within a 
single cluster enrich the ISM collectively with a mixing 
mass that scales linearly with the total energy output of 
the clustered SNe. These two processes, which produce 
additional averaging of newly synthesized material be- 
fore a new generation of stars is formed, have not been 
considered in earlier models. 

The mixing volume, V^^^, of each chemical enrichment 
event is given as a power-law expression. The present 
model is somewhat simplified as we suppress the con- 
tinuous mixing due to the bulk random motions in the 
turbulent ISM. Without the time-dependent turbulent 
diffusion term (cf. Karlsson et al. 2008, equation 1), 



Vfnix reduces to 



47r 



Kix(fc) = y'^E(fc)^/' - Mdii(fc)/p, 



where 






2/3 



(5) 



(6) 



p is the density of the ISM and Mdii(A:) is the dilution 

mass of the ejecta of a number k (= usn) of SNe as 

they merge with the ambient medium (e.g. CiofR, McKee 

& Bertschinger 1988). Here, we assume that the total 

dilution mass for the ejecta of multiple SNe exploding in 

a cluster is proportional, on average, to the total energy 

released by the SNe associated with that cluster, such 

that 

k 

Mdii(fc)-^M,^^, (7) 

j=o 

where M^^ is the mass swept-up by individual SNe as- 
suming it explodes in isolation. In order to introduce 
a small amount of randomness, log(Msw) is drawn from 
a normal distribution centred on log(Msw) = 5 with a 
width of 0.25. Note that this mixing process is denoted 
a single enrichment event, even though it may involve 
enrichment by multiple SNe. 

In addition to this mixing, there is also a mixing vol- 
ume associated with the formation of the cluster during 
the collapse of the molecular cloud, V^ 



k'^;^, such that 



VL. - M/p, 



(8) 



where M is the mass of the molecular cloud associated 
with the star cluster. Within this volume, everything is 
assumed to be thoroughly mixed before stars are allowed 
to form. 

In our models, we explore the range 1 ^ 7 ^ 2.5 
(e.g. Kroupa & Boily 2002), between clusters of mass 
(Mmin, ^max) = (5 Mq, 5 X lO'^ Mq ) , which is our fidu- 
cial mass range (Larsen 2009; Portegies Zwart et al 2010). 
We adopt a simple scaling relation between the cluster 
mass and the mass of the parent molecular cloud, such 
that M — M^,/e, where we adopt a star formation effi- 
ciency of e = 0.2 (see §2). Similarly, the stellar initial 
mass function (IMF) is governed by 

dn/dm — (j){m) — (j)()m~". (9) 

For simplicity, a = 2.35, which recovers the Salpeter 
IMF. The range of stellar masses is set to 0.1 < iti/Mq < 
100 and (t)o ^ 6.03 x 10"^ M©^-^^ 



6 



Bland-Hawthorn, Karlsson, Sharma, Krumholz, & Silk 



Formally, the average number of clusters contributing 
to the chemical enrichment in a random point in space 
can be expressed by the parameter /io, here given by 

t 



Mc(t) 



fcmax 

E 

fe=0 



flfc X ^mix(^)"cl(i')dt' 



(10) 



where a^ = -^/c/X]fc=cr ^^ is the fraction of clusters in 
which k SNe explode, while Mci(i') is the formation rate 
of star clusters, closely related to the star formation rate 
and V"^i^(fc£)) = 0. The value of /cmax is set by the IMF 
and the upper mass limit of star clusters. On average, 
270 massive stars explode as SNe in a cluster of mass 
5 X 10'* M0. Such a cluster will not form more than 320 
SNe at the Scr-level. We set fcmax = 400 to be on the safe 
side. 

Now, if we make the simplifying assumtion that star 
clusters are randomly distributed in space (i.e., not them- 
selves clustered), the probability of finding a region in 
space enriched by k events, i.e., k clusters producing one 
or more SNe, at time t is given by the Poisson distribu- 
tion 

(11) 



P(AC,Aie(t))=e-''<=(*Vc(t)7« 



To follow the chemical enrichment in a typical dwarf 
galaxy, we assume a simulation box of 1.6 kpc on a side, 
with an initial, constant particle density of np = 1 cm~^, 
corresponding to a gas density of po — 2.06 x 10~^^ g 
cm~^. The initial mass of baryons in the box is thus 
3.1 X 10^ M0 which is sufficient to make the largest star 
clusters considered in this work. The initial metallicity is 
set to Z — 0. In the box, star clusters are allowed to form 
from collapsing molecular clouds. The molecular clouds 
are distributed randomly within the box and the masses 
of the corresponding clusters are distributed according to 
equation Q. During the collapse of a molecular cloud, 
the volume, V^i^, of gas corresponding to the mass of 
the cloud is made chemically homogeneous. The stars 
of the cluster will all have chemical abundances equal 
to those of the parent cloud. The number of massive 
stars, k, in the cluster that will explode as SNe is, again, 
determined by the Poisson statistics P(fc,/zsN), where 
the mean number of SNe in a cluster of a given mass is 
given by 

MSN = eM/sn/w, (12) 

where e is the star formation efficiency, M is the mass of 
the molecular cloud, /sn = 1-9 x 10~^ is the fraction of 
massive stars exploding as SNe, and m = 0.35 Mq is the 
mean stellar mass in a single stellar population. The last 
two parameters are determined by the IMF. For large 
values of /isN, the Poisson distribution approaches a nor- 
mal distribution. We make use of this fact to simplify 
the calculations for hsn > 32. In clusters where one or 
more SNe are formed, the ejecta of all SNe, with masses 
distributed according to equation [91 are homogeneously 
mixed with the metals already present within the vol- 
ume V^ix- The Fe-core collapse SN yields, in particular 
of Ca and Fe, are taken from Nomoto et al. (2006) while 
the yields of Eu, representing r-process elements and as- 
sumed to be formed in 0-Ne-Mg core collapse SNe in the 
mass range 8 < m/M© < 10, are taken from Argast et 
al. (2004). In order to match the simulations with ob- 
servations of metal-poor stars in the Galactic halo, the 
yields of Eu are doubled. 



In order to account for the amount of mass locked up 
in low-mass stars and stellar remnants and the mass lost 
due to star formation driven galactic outflows, a fraction, 
/out = 0.46, of the mass of each star-forming molecu- 
lar cloud is subtracted from the total mass of the sys- 
tem. The gas density is decreased accordingly. Infall and 
mass lost via interaction with the surroundings, such as 
tidal and ram-pressure stripping are not considered here. 
When the total mass and gas density of the system has 
been re-calculated, the next cycle begins with the forma- 
tion of a new cluster. Since our focus is in the metal-poor 
regime, the yields of type la SNe and AGB stars are not 
considered here. 



4.3. Results 

The results of the clustered simulations for [Ca/Fel and 
Jr/Fe] with respect to [Fe/H] are presented in Figs. l3]and 
H] respectively. For all models, the number of simulated 
stars is Tl^, ^ 10^ equivalent to a stellar mass of 3 x 10^ 
M0 and a luminosity of 10^ L© assuming a Salpeter IMF; 
the luminosity is two times higher for a Kroupa IMF. 
To begin with, we do not consider the effect of distance 
on the observed number counts; this is treated in §5.2. 
The modelled number of stars is much larger than we 
can expect to obtain in nearby dwarfs, but the results 
provide sufficient resolution to understand the impact of 
the ICMF parameters. The overall distribution of the 
stars with metallicity is shown in Fig. [2] 

There is a clear progression moving from high 7 to low 
7 in the occurrence of clustered abundance signatures. 
The high 7 limit is indistinguishable from the beta (non- 
clustered) models. This is not unexpected since, in the 
limit of 7 = 2.5, almost all stars are formed in small star 
clusters such that unique groupings in abundance space 
are poorly represented. 

As 7 decreases, the clustering increases markedly but 
at the expense of the dispersion in [X/Fe] . In the limit 
of 7 = 1, the vertical spread has vanished over all [Fe/H] 
with little or no abundance spread in [X/Fe]. While 
7 = 1 is smaller than what is observed, it is in fact a 
useful surrogate for demonstrating the impact of a higher 
maximum cloud mass Af^ax for larger 7 values. For ex- 
ample, in Fig. [5] we show the result of running 7 = 1.5 
and 7 = 2.0 models with Af^ax ~ 10® Mq. We see how 
the high mass cut-off at high 7 mimics the behaviour of 
a lower value of 7: the vertical scatter is greatly reduced, 
and groupings are seen to extend to lower [Fe/H]. 

In Fig. [6j we show the cumulative fraction of stars and 
the integrated light as a function of stellar absolute mag- 
nitude for the model dwarf galaxy. In Table 1, we give 
a rough breakdown of how many stars are expected for 
the dwarf galaxy as a function of stellar apparent mag- 
nitude, distance and metallicity. We consider V=16 and 
V=18 to be the bright and faint limit of an 8m class ex- 
periment; these magnitude brackets become V=20 and 
V=22 on an ELT (Bland-Hawthorn et al 2010). 

In the next section, we analyse these simulations using 
a group finder in order to provide an objective assess- 
ment of the amount of measurable clustering in the limit 
of high and low n* that we can expect. We apply this 
analysis for different [Fe/H] cut offs since, as we have 
seen, the abundance plots become crowded as [Fe/H] in- 
creases. 
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5. CLUSTERING IN C-SPACE: A STATISTICAL 
TREATMENT 

5.1. Group finding for large n^ 

In order to investigate the clustering in abundance 
space (defined by [Fe/H] and [Ca/Fe] or [r/Fe] abun- 
dances), we use the density based hierarchical c luster- 
ing algorithm EnLink ( jSharnia fc Johnston |2009 ). The 
method is statistically objective and robust m its appli- 
cation; the method is highly efficient and does not use 
pixellation or binning. 

EnLink is based on the fact that a system having more 
than one group in a data set will have peaks and valleys in 
the density distribution. A peak in a density distribution 
identifies a cluster and the set of points which can "climb 
the peak," identified by following density gradients, are 
labeled as its members. The valleys in between the peaks 
represent intersections between the clusters. These are 
used to define the boundaries of the clusters and also to 
form a parent-child relationship, thereby establishing a 
hierarchical organization of the groups. EnLink works 
by first calculating the densities of the data points using 
a set of qden nearest neighbours and then organizes the 
points in groups by using nearest neighbour links. 

We present our results in the Appendix for both 
[Ca/Fe] and [r/Fe] for different values of 7 over a wide dy- 
namic range in the observed number of data points. We 
confirm that statistically significant groupings in abun- 
dance space can be recovered, particularly at low values 
of 7. For this work, we adopt a low cluster-mass limit of 
.^^min — 5 M0 which is likely to be too conservative. For 
a fixed number of simulation particles, decreasing M-^^-^ 
suppresses the number of detected groups. 

5.2. Group finding for small n^, 

Clustering should be present even in the limit of only 
a few data points. To emphasize this fact, we have sim- 
ulated the abundance measurements for a dwarf galaxy 
at a distance of 30 kpc (see Figs, [s] and [o]) as observed 
on 8m class and 30m class telescopes respectively. We 
adopt a stellar mass of 3 x 10^ Mq typical of a faint 
dwarf galaxy. This object has about 10^ stars, a lumi- 
nosity of 10^ Lq and an absolute V mag of My — —7.6 
assuming a Salpeter IMF. The luminosity is a factor of 
two higher for a Kroupa IMF. The star counts are con- 
sistent with the model values in Table 1 when scaled to 
the adopted lower mass. 

For the 8m experiment (Fig. ^), we assume measure- 
ment errors of 0.1 dex in both~[r/Fe] and [Fe/H]. For 
7 = 1.5 and 7 = 2.0, the effects of clustering are evident 
and this holds true if we double the measurement error. 
The impact of this measurement uncertainty over the full 
simulation is shown in Fig. [Tj We conclude that, once 
more extensive studies are made of the nearest dwarf 
galaxies, the effects of clustering may become evident 
even before the advent of ELTs. We envisage projects 
which focus on relatively few stars that appear grouped 
in abundance space in low resolution spectroscopic data. 
Long integrations and differential analysis will allow the 
null hypothesis to be tested that the stars have identical 
abundances in all elements. 

For the 30m experiment (Fig. ^, we have assumed 
a general improvement in the atmospheric models and 
the experimental errors, and therefore adopt errors of 



0.05 dex. The effects of clustering, which are easily seen, 
remain clearly visible even after a twofold increase in 
the measurement errors in both axes. This simulation is 
a powerful statement of the importance of multi-object 
echellcs on ELTs. 

For a fixed qden Sind data dimensionality, the number 
of spurious groups due to Poisson noise increases linearly 
with the total number of data points n^,. Therefore, 
EnLink can be used in the limit of small n^ with qden 
set to 3, i.e. a minimum of one more than the number 
of data dimensions. But with so few data points, the 
full power of EnLink is not being exploited such that 
more rudimentary statistical techniques may be better. 
However, EnLink is particularly efficient in treating data 
sets with more than two dimensions, even in the limit 
of small n^.The significance of groups increases dramat- 
ically when we apply EnLink on an abundance space 
with more dimensions. The normalizing distribution in 
equa tion [3] is easily extended to higher dimensions. In 
Fig. |A6[ we apply EnLink to the 3D C-space ([Fe/H], 
[Ca/l'te] , [r/Fe]) for three different measurement errors 
(0.05, 0.1, 0.2 dex) typical of contemporary observations 
on 8m telescopes at V=18. Even in the presence of large 
errors, clustering signals are seen in all cases. 

6. DISCUSSION 

This paper has explored the prospect of probing the 
mass scales of the first star clusters. We stress that we 
have used a lower cluster-mass limit of Mj^in = 5 Mq 
(typically 10 stars) which is very conservative. A more 
reasonable value may be an order of magnitude higher 
(cf. Bland-Hawthorn et al 2010). For 7 = 2 and a fixed 
number of simulation particles, the lower threshold has 
the effect of reducing cluster membership by a factor of 
ten, and increasing the "background" by the same fac- 
tor. This lowers the overall clustering signal by an order 
of magnitude. But we adopt the conservative lower mass 
limit in order to account for a possible dwarf stellar pop- 
ulation that are not born in clusters. We were unable 
to find any observational constraints on the diffuse vs. 
clustered population in dwarf galaxies (cf. Lada & Lada 
2003). 

In our conservative analysis, we find that there is an in- 
timate connection between properties of the ICMF and 
the amount of potentially detectable clustering in the 
abundance plane. A flat ICMF and/or a ICMF with a 
high mass cut-off produces strong clustering in the [Fe/H] 
vs. [X/Fe] abundance plane (C-space). While our mod- 
els are inevitably oversimplified, the phenomenon should 
be detectable on 8-lOm telescopes (e.g. Figs, [s] and l9|. 
This gains support from existing observations of open 
clusters, globular clusters and moving groups (Castro et 
al 1999; Shen et al 2005; Randich et al 2006; Sestito 
et al 2007; De Silva, Freeman & Bland-Hawthorn 2009; 
Chou et al 2010; Bubar & King 2010). A clean "cluster- 
ing" signature in C-space, particularly at low metallicity, 
is important for a number of reasons. First, it indicates 
the presence of massive star clusters in the early universe 
and conceivably provides a constraint on the mass of the 
first systems. Second, it provides a clean signal of the 
progenitor abundances in the cloud prior to cluster for- 
mation. This abundance measurement is averaged over a 
substantial amount of gas and is therefore not subject to 
mixing anomalies (Karlsson & Custafsson 2001) or mass 
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transfer in binaries (e.g. Suda et al 2004; Ryan et al to the BIPAC Institute, Oxford for their hospitality. 

2005; Lucatello et al 2005). 

Strong clustering would indicate a highly flattened 
ICMF, or a high mass cut-off. This could herald the 
onset of the formation of massive star clusters in dwarf 
galaxies (e.g. Bromm & Clarke 2002). If the star forma- 
tion efficiencies were low at that time, this may require 
supermassive gas clouds (> 10^ ^0) to have formed even 
at the earliest times (Abel et al 2000), possibly consis- 
tent with the regular occurrence of massive star-forming 
clumps at high redshift (Genzel et al 2006; Forster- 
Schreiber et al 2009; Elmegreen & Elmegreen 2006). 
Conversely, if such clustering was not observed, then we 
would infer that the slope of the early ICMF is steep, or 
the maximum cluster size is relatively small compared to 
the present day. But the observed scatter would need to 
be consistent with the non-detection of clustering. 

In future investigations, from the perspective of chem- 
ical signatures, we will look at the degeneracy between 
steep ICMFs with a high mass cut-off and flatter ICMFs 
with a lower mass cut-off (see §4.2). We will also look 
at a wider class of chemical elements that are considered 
in numerical simulations of the first stars. We will look 
at the improvement in cluster identification with more 
chemical elements, particularly in the limit of small n* . 
The clustered abundance signatures will provide unique 
insight into the most ancient star clusters. These signa- 
tures are signposts of the chemistry immediately before 
the onset of star formation, untainted by mass transfer 
in close binary pairs or incomplete mixing anomalies. 

Finally, it is an extraordinary fact that we can probe 
back to the first billion years from observations of the 
local universe. We can say with absolute certainty that 
stars existed at this time. These were responsible for the 
first chemical elements (Ryan- Weber, Pettini & Madau 
2006) and for reionizing the fog of hydrogen that per- 
meated the early Universe (Fan et al 2002). Precisely 
when the first star clusters formed is unknown. It seems 
likely, however, that gas was able to fragment at very 
high density even at primordial abundance levels (Clark 
et al 2008). It may be possible to directly probe these 
environments in an era of the Atacama Large Millime- 
tre Array and the James Webb Space Telescope. But we 
believe that some of the most important insights, par- 
ticularly with regard to progenitor yields, will undoubt- 
edly come from near-field cosmology. To this end, it 
will be necessary to equip the next generation of ELTs 
with wide-field multi-object spectrographs that operate 
at high spectroscopic resolution (R>20,000). 
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TABLE 1 
Log of the expected numbers of stars as a function of V 

MAG for an old, METAL POOR DWARF SPHEROIDAL (SEE FiG. [2J 
WITH 10^ STARS (stellar MASS 1^ 3 X 10® Mq) AT DISTANCES OF 

10, 30 AND 100 KPC (coL. 1). Columns 2-5 are the star 

COUNTS FOR A SALPETER IMF IN 4 METALLICITY BINS; COLUMNS 

6-9 ARE THE COUNTS FOR A KrOUPA IMF IN THE SAME BINS 

DETERMINED FROM THE MDF IN FiG. 2. THE 4 METALLICITY BINS 

ARE [Fe/H] = (-5:-4), (-4:-3), (-3:-2), (-2:-l); DASHES INDICATE 

THAT NO STARS ARE EXPECTED. 
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APPENDIX 
GROUP FINDING IN AN ABUNDANCE SPACE 
In the EnLink clustering scheme, each group is characterized by a maximum density pmax and a minimum density 



Pmin, and these are used to define its significance S as 

ln(pi, 



S = 



ln(pn 



0-l„(p) 



(Al) 



where crin{p) is the standard deviation associated with the density estimator and is a constant for a given dimensionality 
and qden- 

For Poisson-sampled data the distribution of density as estimated by the code using the kernel scheme is log-normal 
and the variance satisfies the relation 

' (A2) 



C^ln(p) = Y Vd||W^|li/9dc 



where qden is the number of neighbours employed for densit y estimation, Vd the v olume of a d-dimensional unit 



hypersphere and HW^Hi the L2 norm of the kernel function (Sharma & Johnston||2009). For qden — 6 and d = 2, o'in(p) 
evaluates to 0.4714. 

Thus Enlink has two free parameters: (i) the significance threshold of the group St, and (ii) the number of nearest 
neighbours qden used for density estimation. The variable qden should be set to less than the minimum desired size of 
the groups because the density is smoothed over a scale of qden nearest neighbours such that the significance and hence 
the probability of detecting groups of size less than qden is drastically reduced. Additionally, qunk = miii(10,(7(ien — 1) 
neighbours are used for linking the groups which means groups whose density peaks lie within qunk nearest neighbours 
of each other cannot be separated. Since we are interested in identifying groups with less than 10 data points, we set 
Qden — 6; the case for n^ < 6 is discussed in the next section. 

Our initial analysis is for data points with [Fe/H] < -2.5. This includes a crowded, high-density region of the 
abundance plane which skews the clustering analysis considerably. However, this may be an important regime if the 
first chemical elements arise from pair instability supernovae (Karlsson et al 2008) . The significance of spurious groups 
is distributed almost as a Gaussian distribution and hence the significance thr eshold St is used to suppress these 
groups. An improved version of the empirical formula ( Sharma fc Johnston|2009 1 for the expected number of spurious 
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Fig. 1. — Four realizations of the beta distribution in equationis] The vertical axis log^ can be trivially rescaled to match the observed 
scatter in [X/Fe]. The abundance scatter declines quadratically (m linear space) as expected (see §3.1). 



groups, valid for 5t > 1, is given by 



G{S > St)=[1- erf 



fSTfdA\ OAn 



{—) 



<7dc 



(A3) 



Al 



and 



A2 



where fdq — 0■5^yd{l — 2.3/q^en) is a small correction term and n^, is the number of data points.^ In Figs. 

we plot the significance distribution 5 of identified groups for data sets with different values of n* and 7 as labellec 

on the plots. 

We set St such that the expected number of spurious groups (false positives) with 5 > 5t is about 2 in all data sets 
which we calibrate from the beta models. St is set to 1.0, 2.35, 3.5 and 4.35 for data with n* as 100, 300, 1000 and 
3000 respectively. This number is increasing because, for a fixed qden and data dimensionality, the number of spurious 
groups due to Poisson noise increases linearly with the total number of data points n* . 

Each panel shows the outcome of five EnLink analyses: the cluster-less beta model (solid black line); the simulated 
models Al and Bl having unrealistic abundance errors 0.01 dex shown as solid red and blue curves respectively; the 
simulated models A5 and B5 having abundance errors 0.05 dex shown as dotted red and blue lines respectively. In each 
panel, the mean number of identified groups G along with its dispersion ac is also given. The mean and dispersion 
were calculated using 100 random realizations of each data type. The plotted distributions are also averaged over 
these 100 random realizations. We also provide a measure of the statistical significance of detecting clusters in a data 

'' Since the distribution in S for spurious groups is roughly normal (equation 15), S can be loosely interpreted as the statistical z-score 
for finding a group per unit data point. 
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-3 -2.5 

[Fe/H] 

Fig. 2. — Metallicity distribution function (MDF) for the stochastic chemical evolution models presented in Figs, [s] and [4] The main 
plot shows the log of the MDF and the inset is the more conventional linear MDF. The quantity /« is the fraction of stars that fall within 
each [Fe/H] bin (1 dex). The dashed curve shows the gradient defined by logd/»/d[Fe/H] = 1 which illustrates that the fraction of stars 
in each [Fe/H] bin increases roughly by a factor of 10 as the metallicity increases up to the turnover at [Fe/H] Ri —1.5. 

set as follows. If Gb is the number of groups as predicted by the smooth beta model, the statistical significance of 
clustering in a model is given by 

Cs = ^ ^} ^^^^ (A4) 



The third column shows the value of Cs for each data set. 

It can be seen in Fig. |A1| and |A2| that the expected number of spurious groups for the beta models are nearly 
independent of the value of 7, which is a consequence of the adaptive metric scheme. Specifically, EnLink uses the 
concept of a locally adaptive metric, which is used for calculating densities and nearest neighbours of data points (a 
refinement over earlier developments by Ascasibar & Binney [2005]) to increase the efficiency of detecting clusters in 
multi-dimensional spaces. If instead clustering is performed using a Euclidean metric on raw data, the significance 
distribution of spurious groups due to Poisson noise is found to vary with the distribution of points in the abundance 
space, e.g., the beta models with different values of 7. 

In the limit of few data points, the locally adaptive metric scheme is equivalent to using a metric which is given 
by the inverse of the dispersion along each dimension. As a check we also performed the analysis using this simpler 
scheme and found equivalent results, thereby demonstrating the robustness of our group-finding analysis. For the 
sake of accuracy, we use the beta models to evaluate the significance of clustering, but strictly speaking this is not 
required and any Poisson sampled cluster-less data would also suffice. In fact, the empirically derived formula given 
by Equation (A3 1 can also be used directly to predict the number of spurious groups. 



Next we compare the parameter Cs for the different cases. It can be seen that as n^ and 7 increase, Cs increases 
also. First we look at cases with unrealistic measurement errors of 0.01 dex. It is clear that the 7 = 2.5 models 
are nearly undetectable for all values of n* and closely resemble the beta models. For 7 = 2.0, a minimum value of 
n^ = 300 is needed for Cs ^ 1- For lower values of 7, signatures of clustering are visible with as few as 100 points. 
Increasing the abundance errors to 0.05 dex significantly affects our ability to detect clusters. When sufficient number 
of data points are present, the A5 and B5 models perform better for 7 = 1.5 and 2.0 as compared to 7 = 1. This 
is because the clusters in these cases are more in number, are spread over a larger area, are less crowded, and hence 
easier to detect. 

Since clustering is most prominent for lower values of [Fe /H], we also investigated data sets with a lower metallicity 
cut-off, [Fe/H] < -3.0. These results are shown in Figs. A3 and A4 Th e pa ram eter Cg is found to increase in general 
for all cases, unlike what is seen in the [Fe/H] < -2.5 analysis (Figs. Al and A2). It is striking how much better 
EnLink performs on these simulations which is to be expected given the broader intrinsic dispersions of [r/Fe] in the 
models. In lig ht of these results, another way to look at Fig. [4] is the significance S of clustering over the abundance 
plane. In Fig. |A5[ we show the S contours for the four different values of 7 with lower significance regions to high 
[Fe/H] . Clustered abundance data points near the mean value of [r/Fe] are more likely at low [Fe/H] for low values of 
7. Clustered abundance data points away from the mean are favoured by higher values of 7. 
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Fig. 3. — The results of the stochastic chemical evolution modelling in §5 for a element Ca compared to [Fe/H]. The five models from 
top to bottom are: (i) no clustering, (ii) 7 = 2.5, (iii) 7 = 2.0, (iv) 7 = 1.5, (v) 7 = 1.0. The left and right hand panels are two different 
realizations of the same 5 models. The clustering in abundance space becomes very apparent at low values of 7. High values of 7 are barely 
distinguishable from the "no clustering" distribution in (i). Note also that the scattering at a fixed value of [Fe/H] decreases dramatically 
with decreasing values of 7. The intrinsic dispersion within individual clusters (0.01 dex) is much less than expected in real data; more 
realistic models with fewer data points and increased measurement error are presented in Fig. Is] 
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Fig. 4. — The results of the stochastic chemical evolution modelling in §5 for r-process element (specifically Eu) compared to [Fe/H]. 
The vertical extent is 4 dex in fr/Fe] , i.e. a fourfold increase over [Ca/Fe] in Fig. at (The vertical and horizontal axes are presented with 
the correct aspect ratio in Figs.pjandJT]) The five models from top to bottom are: (i) no clustering, (ii) 7 = 2.5, (iii) 7 = 2.0, (iv) 7 = 1.5, 
(v) 7 = 1.0. The left and right hand panels are two different realizations of the same 5 models. The clustering in abundance space becomes 
very apparent at low values of 7. High values of 7 are barely distinguishable from the "no clustering" distribution in (i). Note also that the 
scattering at a fixed value of [Fe/H] decreases dramatically with decreasing values of 7. In order to make clustered points more circular, 
the intrinsic dispersion within individual clusters is A[Fe/H]=0.01 dex and A[Eu/Fc]=0.035 dex. 
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Fig. 5. — The results of the stochastic chemical evolution modelling in §5 for the 2D space C([Fc/H], [r/Fe]). The top two panels are 
repeated from Fig.|4|for which 7 = 1.5 (left) and 7 = 2.0 (right), both with a high mass cut-off Mmax = 5.0 X 10* Mq. The bottom two 
panels use 7 = 1.5 (left) and 7 = 2.0 (right) but with a high mass cut-off A/max = 10 X 10® M0, twenty times higher than was used for the 
upper panels. Note how the high mass cut-off at high 7 mimics the behaviour of a lower value of 7: the distribution is everywhere flattened 
in the vertical direction, and groupings are seen to extend to lower [Fe/H]. This behaviour is particularly apparent in the left-hand figures. 
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Fig. 6. — (Top) Cumulative fraction of stars brighter than a given stellar absolute magnitude Afy for a Salpcter (lower) and Kroupa 
(upper) IMF. The high and low stellar mass cut-offs for both IMFs are 0.1 Mq and 100 M0. The stellar population has a mass of 3 X 10® 
Mq and we use the Padova isochrones (see text) for a 12 Gyr old single burst, metal poor population. (Bottom) Integrated light from this 
same population brighter than a given stellar absolute magnitude My. 
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Fig. 7. — The same models presented in Fig.Wlbut without the fourfold compression in the [r/Fe] axis. The top figures are for 7 = 1.5 
and 7 = 2 and have an unreahstic intrinsic scatter of 0.01 dex. The bottom figures are repeated but with an intrinsic scatter of 0.1 dex. 
The effects of clumping are clearly seen in both distributions. 



18 



Bland-Hawthorn, Karlsson, Sharma, Krumholz, & Silk 



CD 



CD 




2.0 



1.0 



0.0 



-1.0 



-2.0 





^> 


+ 




h . 




+^ 




A> : 


V = 18 


+ + . 


a = 0.1 


+ + 


No clustering 


— , — 1 — , — 1 — , — 1 — , — 1 — , — 1 


. a- 1 



+ 



++ 



r-H- ■-■+ 



% 



^ 



^ 






^ V = 22 
+ a = 0.05 
Y =1.5 



I 1 1 I I I I i_ 




V = 22 ym+^ ^ 

+ -H"+ + 

a = 0.05 



+ -U++- 



+^^+ +)-. 



+ , 4- + 

No clustering ^_J 



+ H^ 



-5.0 -4.0 -3.0 -2.0 
[Fe/H] 



-5.0 -4.0 -3.0 -2.0 
[Fe/H] 



Fig. 8. — A simulation of a targetted study of a nearby dwarf galaxy on an Sm (top) and SOm (bottom) telescope. The data points are 
drawn from Fig. Wlfor a galaxy with a stellar mass of 3 X IO^Mq at a distance of 30 kpc (see Tablenl: (left) 7 = 1.5 (right) no clustering. 
The simulated errors are 0.1 dex in the top figures and 0.05 dex in the bottom figures. There is evidence of clustering at [Fc/H] < —3.0 
from a sample of 10 stars on an 8m class telescope; the clustering is easily detected in the SOm telescope experiment. 
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Fig. 9. — A simulation of a targetted study of a nearby dwarf galaxy on an Sm (top) and SOm (bottom) telescope. The data points are 
drawn from Fig. Wlfor a galaxy with a stellar mass of 3 X IO^Mq at a distance of 30 kpc (see Tablenl: (left) 7 = 2.0 (right) no clustering. 
The simulated errors are 0.1 dex in the top figures and 0.05 dex in the bottom figures. There is evidence of clustering at [Fe/H] < —3.0 
from a sample of 10 stars on an 8m class telescope; the clustering is easily detected in the SOm telescope experiment. 
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([Co/Fe],[Fe/H]) [Fe/H]<-2.5 
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Fig. A1. — The results of EnLink group finding applied to the models presented in Fig.|3]at a metallicity cut-ofi^ [Fe/H] < —2.5. The 
models represent the 2D space C([Fe/H], [Ca/Fe]) where the four rows show results for n* = 100,300,1000,3000 data points. The four 
columns present results for 7 = 1.0,1.5,2.0,2.5. In each panel, there are five distinct significance distribution functions G{S) (equation 
15); note that the vertical range increases as n* increases (see §5.1). The black histogram is the EnLink analysis of the control sample 
generated by the beta distribution in equation [3] which, by definition, has no clustering. The solid blue and red histograms (Al, Bl) are 



the analysis of the A and B realizations respecHvely (0.01 dex uncertainty) shown in Fig. ^ the dotted histograms (A5, B5) repeat the 
analysis for an abundance uncertainty of 0.05 dex. The statistical means and uncertainties (in brackets) for all distributions arc given in 
the insets; the third value is the Cg statistic (see text). 
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Fig. A2. — The results of EnLink group finding applied to the models presented in Fig. 14] at a metallicity cut-ofi^ [Fe/H] < —2.5. The 
models represent the 2D space C{[Fc/H], [r/Fe]) where the four rows show results for n, = 100,300,1000,3000 data points. The four 
columns present results for 7 = 1.0,1.5,2.0,2.5. In each panel, there are five distinct significance distribution functions G{S) (equation 
15); note that the vertical range increases as n* increases (see §5.1). The black histogram is the EnLink analysis of the control sample 
generated by the beta distribution in equation [3] which, by definition, has no clustering. The solid blue and red histograms (Al, Bl) are 



the analysis of the A and B realizations respecHvely (0.01 dex uncertainty) shown in Fig. W] the dotted histograms (A5, B5) repeat the 
analysis for an abundance uncertainty of 0.05 dex. The statistical means and uncertainties (in brackets) for all distributions are given in 
the insets; the third value is the Cg statistic (see text). 
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Fig. A3. — The results of EiiLink group finding applied to the models presented in Fig 
for n» = 100 and 300 data points. The four columns present results for 7 = 1.0, 1.5, 2.0, 
effects of clustering are now more apparent, even in the limit of small statistics. 
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A5. — The statistical significance distribution S (equation 13) for the four clustered models presented in Fig. 14] The contours are 
(from right to left): 5 = 2,4, 6, 8, 10, 12. Note, for example, groupings of data points are more likely near the mean value of [r/Fe] at low 
[Fe/H] for low 7. Clustered abundance data points away from the mean are favoured by higher values of 7. 
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Fig. A6.— The results of EnLink group finding applied to the 3D space C{[Fe/H], [Ca/Fe], [r/Fe]) for [Fe/H] < -3.0. The two rows 
show results for N = 100 and 300 data points. The two columns present results for 7 = 1.5, 2.0. The histograms are defined in Fig. |A2| 
The simulated measurement errors are now 0.05 (red), 0.10 (blue) and 0.20 (purple) dex. The effects of clustering are apparent, even in 
the limit of small statistics and with larger measurement errors. 



